Genomic Prediction Accounting for Residual Heteroskedasticity
نویسندگان
چکیده
Whole-genome prediction (WGP) models that use single-nucleotide polymorphism marker information to predict genetic merit of animals and plants typically assume homogeneous residual variance. However, variability is often heterogeneous across agricultural production systems and may subsequently bias WGP-based inferences. This study extends classical WGP models based on normality, heavy-tailed specifications and variable selection to explicitly account for environmentally-driven residual heteroskedasticity under a hierarchical Bayesian mixed-models framework. WGP models assuming homogeneous or heterogeneous residual variances were fitted to training data generated under simulation scenarios reflecting a gradient of increasing heteroskedasticity. Model fit was based on pseudo-Bayes factors and also on prediction accuracy of genomic breeding values computed on a validation data subset one generation removed from the simulated training dataset. Homogeneous vs. heterogeneous residual variance WGP models were also fitted to two quantitative traits, namely 45-min postmortem carcass temperature and loin muscle pH, recorded in a swine resource population dataset prescreened for high and mild residual heteroskedasticity, respectively. Fit of competing WGP models was compared using pseudo-Bayes factors. Predictive ability, defined as the correlation between predicted and observed phenotypes in validation sets of a five-fold cross-validation was also computed. Heteroskedastic error WGP models showed improved model fit and enhanced prediction accuracy compared to homoskedastic error WGP models although the magnitude of the improvement was small (less than two percentage points net gain in prediction accuracy). Nevertheless, accounting for residual heteroskedasticity did improve accuracy of selection, especially on individuals of extreme genetic merit.
منابع مشابه
Multiple-breed genetic inference using heavy-tailed structural models for heterogeneous residual variances.
Multiple-breed genetic models recently have been demonstrated to account for the heterogenous genetic variances that exist between different beef cattle breed groups. We extend these models to allow for residual heteroskedasticity (heterogeneous residual variances), specified as a function of fixed effects (e.g., sex, breed proportion, breed group heterozygosity) and random effects such as cont...
متن کاملAccounting for outliers and heteroskedasticity in multibreed genetic evaluations of postweaning gain of Nelore-Hereford cattle.
The objectives of this study were to demonstrate the utility of hierarchical Bayesian models combining residual heteroskedasticity with robustness for outlier detection and muting and to evaluate the effects of such joint modeling in multibreed genetic evaluations. A 3 x 2 factorial specification of 6 residual variance models based on several distributional (Gaussian, Student's t, or Slash) and...
متن کاملShould the Markers on X Chromosome be Used for Genomic Prediction?
This study investigated the accuracy of imputation from LD (7K) to 54K panel and compared accuracy of genomic prediction with or without the X chromosome information, based on data of Nordic Holstein bulls. Beagle and Findhap were used for imputation. Averaged over two imputation datasets, the allele correct rates of imputation using Findhap were 98.2% for autosomal markers, 89.7% for markers o...
متن کاملA General Affine Earnings Valuation Model
We introduce a methodology, with two applications, that incorporates stochastic interest rates, heteroskedasticity and risk aversion into the residual income model. In the first application, goodwill is an affine (constant plus linear term) function where the constant and linear coefficients are time-varying. Homoskedastic risk gives rise to a constant risk premium, while heteroskedastic risk g...
متن کاملResidual life prediction from statistical features and a GARCH modeling approach for aircraft generators
Condition-based maintenance is currently widely used in the aviation industry with diagnoses obtained from the performance data of the aircraft. Online assessments of the real-time condition and predicted residual life have been of great importance for both mechanics and pilots, especially during flight for the latter. Statistical distribution and feature parameters are believed to be crucial c...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 6 شماره
صفحات -
تاریخ انتشار 2015